Search CORE

138 research outputs found

Introduction to IND and recursive partitioning, version 1.0

Author: Buntine Wray
Caruana Rich
Publication venue
Publication date
Field of study

This manual describes the IND package for learning tree classifiers from data. The package is an integrated C and C shell re-implementation of tree learning routines such as CART, C4, and various MDL and Bayesian variations. The package includes routines for experiment control, interactive operation, and analysis of tree building. The manual introduces the system and its many options, gives a basic review of tree learning, contains a guide to the literature and a glossary, lists the manual pages for the routines, and instructions on installation

NASA Technical Reports Server

Introduction in IND and recursive partitioning

Author: Buntine Wray
Caruana Rich
Publication venue
Publication date
Field of study

This manual describes the IND package for learning tree classifiers from data. The package is an integrated C and C shell re-implementation of tree learning routines such as CART, C4, and various MDL and Bayesian variations. The package includes routines for experiment control, interactive operation, and analysis of tree building. The manual introduces the system and its many options, gives a basic review of tree learning, contains a guide to the literature and a glossary, and lists the manual pages for the routines and instructions on installation

NASA Technical Reports Server

Multitask Evolution with Cartesian Genetic Programming

Author: Caruana Rich
Dorigo Marco
Miller Julian F
Publication venue
Publication date: 24/04/2017
Field of study

We introduce a genetic programming method for solving multiple Boolean circuit synthesis tasks simultaneously. This allows us to solve a set of elementary logic functions twice as easily as with a direct, single-task approach.Comment: 2 page

arXiv.org e-Print Archive

Crossref

Axiomatic Interpretability for Multiclass Additive Models

Author: Caruana Rich
Chajewska Urszula
Koch Paul
Lou Yin
Tan Sarah
Zhang Xuezhou
Publication venue
Publication date: 30/05/2019
Field of study

Generalized additive models (GAMs) are favored in many regression and binary classification problems because they are able to fit complex, nonlinear functions while still remaining interpretable. In the first part of this paper, we generalize a state-of-the-art GAM learning algorithm based on boosted trees to the multiclass setting, and show that this multiclass algorithm outperforms existing GAM learning algorithms and sometimes matches the performance of full complexity models such as gradient boosted trees. In the second part, we turn our attention to the interpretability of GAMs in the multiclass setting. Surprisingly, the natural interpretability of GAMs breaks down when there are more than two classes. Naive interpretation of multiclass GAMs can lead to false conclusions. Inspired by binary GAMs, we identify two axioms that any additive model must satisfy in order to not be visually misleading. We then develop a technique called Additive Post-Processing for Interpretability (API), that provably transforms a pre-trained additive model to satisfy the interpretability axioms without sacrificing accuracy. The technique works not just on models trained with our learning algorithm, but on any multiclass additive model, including multiclass linear and logistic regression. We demonstrate the effectiveness of API on a 12-class infant mortality dataset.Comment: KDD 201

arXiv.org e-Print Archive

Crossref